On Tue, May 14, 2013 at 6:17 PM, Hans Hagen wrote:
On 5/14/2013 6:07 PM, luigi scarso wrote:
I Hope that someone can help here
as Mojca mentioned thai at bachotex i'll add the patterns as a start
given specs, examples and time, adding support for thai to context shouldn't
be too hard
On 5/15/2013 4:09 PM, Mojca Miklavec wrote:
On Tue, May 14, 2013 at 6:17 PM, Hans Hagen wrote:
On 5/14/2013 6:07 PM, luigi scarso wrote:
I Hope that someone can help here
as Mojca mentioned thai at bachotex i'll add the patterns as a start
given specs, examples and time, adding support
On Wed, May 15, 2013 at 5:20 PM, Hans Hagen pra...@wxs.nl wrote:
But ... Luigi is already teaching himself Thai, so ...
no no, just connecting people on different ml.
Currently I'm in a completely different area
--
luigi
: suppose that a motivated Thai programmer
would be willing to work on solving the problem properly. What would
be the suggested solution?
You can post also in the context ml, maybe there is some Thai user there
.
I am a Thai developer who works on Thai word segmentation tools and
thailatex
On 5/14/2013 6:07 PM, luigi scarso wrote:
I Hope that someone can help here
as Mojca mentioned thai at bachotex i'll add the patterns as a start
given specs, examples and time, adding support for thai to context
shouldn't be too hard (assuming that there are users)
Hans
(\number#2,#1)}}
+% \def\dolanguagecharacters#1#2{\ctxlua{converters.alphabetic(#2,#1)}}
% this does not belong here, but in a lang-module
-\def\thainumerals #1{\ctxlua{converters.alphabetic(\number#1,thai)}}
-\def\devanagarinumerals#1{\ctxlua{converters.alphabetic(\number#1,devanagari)}}
-\def
are valid.
This open the door to similar questions (cfr core-con.lua,core-con.tex for
persian,thai etc)
and given that Unicode sooner or later will cover all kind of writing
systems of the human race,
I expect that some day some questions will be mixed with maya numerals.
--
luigi
)
% for historical reasons / compatibility
\defineregimesynonym[windows][cp1252]
% 5 - Cyrillic
% 6 - Arabic (not supported)
% 7 - Greek
% 8 - Hebrew (3 signs missing)
% 11 - Thai (not supported)
\defineregimesynonym[il1][iso-8859-1]
\defineregimesynonym[il2][iso-8859-2]
\defineregimesynonym[il3
- Thai (not supported)
\defineregimesynonym[il1][iso-8859-1]
\defineregimesynonym[il2][iso-8859-2]
\defineregimesynonym[il3][iso-8859-3]
\defineregimesynonym[il4][iso-8859-4]
\defineregimesynonym[il5][iso-8859-9]
\defineregimesynonym[il6][iso-8859-10]
\defineregimesynonym[il7][iso-8859-13
. Not supported: Hebrew, Arabic, Vietnamese? for cp125X
and Arabic, Thai and Celtic for iso-8859-X.)
I'll send the files (full content is already on my page), but I need
to know how to split/group them (I guess it would be a bad idea to
have one file for each encoding). Should there be one file for
iso-8859
with
(I'll send the good files soon.)
Except Celtic, Thai, Arabic and Hebrew (although the letter names for
Hebrew are almost completely defined) almost all the windows and ISO
regimes are OK, just some glyphs are missing (which are, or at least
were, missing in Unicode vectors as well). If anyone has
to ask for core support for windows-1250
(perhaps other users may find some other regimes useful as well).
just send me the files you feel confident with
(I'll send the good files soon.)
Except Celtic, Thai, Arabic and Hebrew (although the letter names for
Hebrew are almost completely defined
(s),
please let me know.
IV. With the help of the prepared names list I processed definitions for
regimes (taken from Unicode webpage) for ISO-8859-* and cp125* (others
should be trivial). They are only preliminary, some (Hebrew, Thai,
Arabic) probably don't make any sense yet, but could
is that?
ISO-8859-9 Turkish
ISO-8859-10 Nordic
ISO-8859-11 Thai
ISO-8859-13 Baltic
ISO-8859-14 Celtic
ISO-8859-15 Western
ISO-8859-16 Romanian
\defineregimesynonym[il*][iso-8859-*], *=1-16\12
\defineregimesynonym[latin*][iso-8859-*], *=1-16\12
\defineregimesynonym
14 matches
Mail list logo